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As a below-named inventor, I hereby declare that: \^ i^l 

My residence, post office address and citizenship are as stated beIow^J5e^{j^g;^wmie 



I BELIEVE I AM THE ORIGINAL, FIRST AND SOLE INVENTOR (if only one name is listed below) OR AN 
ORIGINAL, FIRST AND JOINT INVENTOR (if more than one name is listed below) OF THE SUBJECT MATTER 
WHICH IS CLAIMED AND FOR WHICH A PATENT IS SOUGHT ON THE INVENTION ENTITLED: 

Hierarchical Storage Architecture for Reconfigurable Logic Configurations 



the specification of which 

(check one) is attached hereto; 

El was filed on October 3, 2000 as 

Application No. 09/679,398 

and was amended on ; 

(if applicable) 



I HAVE REVIEWED AND UNDERSTAND THE CONTENTS OF THE ABOVE-IDENTIFIED SPECIFICATION, 
INCLUDING THE CLAIMS, AS AMENDED BY ANY AMENDMENT REFERRED TO ABOVE; 

I ACKNOWLEDGE THE DUTY TO DISCLOSE TO THE OFFICE ALL INFORMATION KNOWN TO ME TO BE 
MATERIAL TO PATENTABILITY AS DEFINED IN TITLE 37, CODE OF FEDERAL REGULATIONS, Sec. 1.56 
(as amended effective March 16, 1992); 

I do not know and do not believe the said invention was ever known or used in the United States of America before my 
or our invention thereof, or patented or described in any printed publication in any country before my or our invention 
thereof or more than one year prior to said application; that said invention was not in public use or on sale in the 
United States of America more than one year prior to said application; that said invention has not been patented or 
made the subject of an inventor's certificate issued before the date of said application in any country foreign to the 
United States of America on any application filed by me or my legal representatives or assigns more than twelve 
months prior to said application; 

I hereby claim foreign priority benefits under Title 35, United States Code Sec, 119 and/or Sec. 365 of any foreign 
application(s) for patent or inventor's certificate as indicated below and have also identified below any foreign 
application for patent or inventor's certificate on this invention having a filing date before that of the application(s) on 
which priority is claimed: 
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and: Joseph P. O'Malley, Reg. No. 36,226 



Address all correspondence to: 



Robert E. Krebs 

BURNS, DoANE, Swecker & Mathis, L.L.P. 
P.O, Box 1404 

Alexandria, Virginia 22313-1404 



Address all telephone calls to: Joseph P. O'Malley 



at (650)622-2300. 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false state- 
ments and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the application or any patent issued thereon. 
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POST OFFICE ADDRESS 
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FULL NAME OF SECOND JOINT INVENTOR, IF ANY 
DALE WONG 
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DATE 
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855 35* Avenue. San Francisco, CA 94121 


CITIZENSHIP 

United States of America 


POST OFFICE ADDRESS 

855 35* Avenue, San Francisco, CA 94121 
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Attorney's Docket No. 

ASSIGNMENT 
(JOINT) 



THIS ASSIGNMENT, by CHRISTOPHER E. PHILLIPS , and DALE WONG , residing at 
5888 Assisi Court, San Jose, California 95138-2314 and 855 35^*^ Avenue, S an Francisco, 
California 941 21 (hereinafter referred to as "the Assignors"), respectively, witnesseth: 

WHEREAS, the Assignors have invented certain new and useful improvements in 
HIERARCHICAL STORAGE ARCHITECTURE FOR RECONFIGURABLE LOGIC CONFIGURATIONS , 
set forth in an application for Letters Patent of the United States, 

(1) □ which Is a provisional application 

(a) □ to be filed herewith; or 

(b) □ bearing Application No. and filed on or 

(2) H which is a non-provisional application 

(a) □ having an oath or declaration executed on even date herewith prior 
to filing of application; 

(b) H bearing Application No. 09/679,398 , and filed on October 3, 2000; 
or 

(c) □ to be filed; and 

WHEREAS, CHAMELEON SYSTEMS, INC.. . a corporation duly organized under and 
pursuant to the laws of the State of California and having its principal place of business at 1 61 
NORTECH Parkway, San Jose. CA 951 34 (hereinafter referred to as "the Assignee"), is desirous of 
acquiring the entire right, title, and interest in and to said inventions, the right to file applications 
on said inventions .and the entire right, title and interest in and to any applications, including 
provisional applications for Letters Patent of the United States or other countries claiming priority 
to said application, and in and to any Letters Patent or Patents, United States or foreign, to be 
obtained therefor and thereon. 

NOW, THEREFORE, in consideration of One Dollar ($1 .00) and other good and sufficient 
consideration, the receipt of which is hereby acknowledged, the Assignors have sold, assigned, 
transferred, and set over, and by these presents do sell, assign, transfer, and set over, unto the 
Assignee, its successors, legal representatives, and assigns the entire right, title, and interest in and 
to the above-mentioned inventions, the right to file applications on said inventions and the entire 
right, title and interest in and to any applications for Letters Patent of the United States or other 
countries claiming priority to said applications, and any and all Letters Patent or Patents of the 
United States of America and all foreign countries that may be granted therefor and thereon, and 
in and to any and all applications claiming priority to said applications, divisions, continuations, and 
continuations-in-part of said applications, and reissues and extensions of said Letters Patent or 
Patents, and all rights under the International Convention for the Protection of Industrial Property, 
the same to be held and enjoyed by the Assignee, for its own use and behalf and the use and behalf 
of its successors, legal representatives, and assigns, to the full end of the term or terms for which 
Letters Patent or Patents may be granted as fully and entirely as the same would have been held 
and enjoyed by the Assignors had this sale and assignment not been made; 

AND for the same consideration, the Assignors hereby covenant and agree to and with 
the Assignee, its successors, legal representatives, and assigns, that, at the time of execution and 
delivery of these presents, the Assignors are the sole and lawful owners of the entire right, title, and 
interest in and to the inventions set forth in said applications and said applications, including 
provisional applications, above-mentioned, and that the same are unencumbered, and that the 
Assignors have good and full right and lawful authority to sell and convey the same in the manner 
herein set forth; 
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AND for the same consideration, the Assignors hereby covenant and agree to and with 
the Assignee, its successors, legal representatives, and assigns that the Assignors will, whenever 
counsel of the Assignee, or the counsel of its successors, legal representatives, and assigns, shall 
advise that any proceeding in connection with said inventions or said applications for Letters Patent 
or Patents, or any proceeding in connection with Letters Patent or Patents for said inventions in any 
country, including interference proceedings, is lawful and desirable, or that any application claiming 
priority to said application, division, continuation, or continuation-in-part of any applications for 
Letters Patent or Patents, or any reissue or extension of any Letters Patent or Patents to be 
obtained thereon, is lawful and desirable, sign all papers and documents, take all lawful oaths, and 
do all acts necessary or required to be done for the procurement, maintenance, enforcement, and 
defense of Letters Patent or Patents for said inventions, without charge to the Assignee, its 
successors, legal representatives, and assigns, but at the cost and expense of the Assignee, its 
successors, legal representatives, and assigns; 

AND the Assignors hereby authorize and request the attorneys of BURNS, DOANE, 
SwECKER & Mathis, L.L.P. of Alexandria, Virginia to insert in the spaces provided above the filing 
date, application number, and attorney docket number of said application when known. 

AND the Assignors hereby request the Commissioner of Patents to issue any and all 
said Letters Patent of the United States to the Assignee as the Assignee of said inventions, the 
Letters Patent to be issued for the sole use and behalf of the Assignee, its successors, legal 
representatives, and assigns. 



Date Signature of Assignor 



Christopher E. Phillips 



Date Signature of Assignor 



Dale Wong 
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"Express Mail" Mailing Label No. EL 521 508 415 US 

Date of Deposit October 3. 2000 

I hereby certify that this paper or fee is being deposited with the United 
States Postal Service "Express Mail Post'OffKRo Addressee" service under 37 
CFR 1.10 on the date indicated above, with sufficient postage affixed, and is 
addressed to BOX PATENT APPLICATION. Assistant Commissioner for 
Patents, Washington, D.C. 2023 1 . 

SeyaiMj^tangii 

(Typed or print^^aifie ot^son. mailing paper or fee) 

HDDERARCfflGAL STORAGE ARCBEITECTURE FOR 
RECONFIGURABLE LOGIC CONFIGURATIONS 



ABSTRACT OF THE DISCLOSURE 

1. Field of the Invention 

The present invention relates to reconfigurable computing. 

2. State of the Art 

A Field Programmable Gate Array (FPGA) is a single-chip combination of 
computing elements and storage elements. The computing elements can be 
configured to implement different logic functions depending on the values stored in 
the storage elements. A collection of such values that can configure all the 
computing elements on the chip will be referred to as a "configuration plane". A 
collection of values that is a subset of a plane will be referred to as a 
"configuration. " 

In a conventional FPGA, there is only enough on-chip storage for a single 
configuration plane. In a variant of FPGAs known as Reconfigurable Logic, there 
may be enough on-chip storage for multiple configuration planes. In reconfigurable 
logic there is typically some mechanism for rapidly changing which plane is 
currently configuring the computing elements. In addition, there is typically some 
mechanism for loading the multiple planes from off-chip storage, which can result 
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in virtually unlimited configurations for the chip. However, the time required to 
load die off-chip configuration data is the botdeneck for current implementations. 

The off-chip loading is typically handle^y either a caching or a pre-fetch 
strategy. In a caching strategy, an on-chip cache of the most recendy used 
configurations is stored, and in the event of a cache miss, die chip is stalled until die 
configuration can be loaded from off-chip. This is a delay of several hundreds of 
clock cycles for the current generation of reconfigurable logic. In a pre-fetch 
strategy, the overall schedule of configuration invocations is analyzed and the 
appropriate configurations are loaded into the configuration planes before they are 
needed, ideally avoiding stalling the chip. However, the more time required to load 
an off-chip configuration, the more branching in the configuration schedule will be 
encountered between the pre-fetch and the actual use, possibly invalidating the 
original pre-fetch decision and stalling the chip. 

SUMMARY OF THE INVENTION 
The present invention, generally speaking, provides a hierarchy of 
configuration storage. The highest level of the hierarchy is an active configuration 
store; the lowest level is an off-chip configuration store; in between are one or more 
levels of configuration stores. Every configuration is promoted from the lowest 
off-chip level, through each level, up to the highest active level. Each ascending 
level of the hierarchy has a decreasing latency time required to promote a 
configuration to the next higher level of the hierarchy, and a decreasing amount of 
available storage. This separation into levels allows the amount of available storage 
to be adjusted depending on the inherent latency of the levers storage mechanism, 
where a longer latency requires a larger cache. This in turn allows the total 
required storage for a given performance level to be minimized. 
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BRIEF DESCRIPTION OF THE DRAWING 
The present invention may be fiirther understood from the following 
description in conjunction with the appended4^^ing-. In the drawing: 

Figure 1 is a diagram of an exemplary configuration storage hierarchy; 

5 ■ 

Figure 2a is a simplified example of a configuration to be compressed; 

Figure 2b is a compressed format used to represent the configuration of 
Figure 2a, the bits of the representation being further compressed; 

10 

Figure 3 is a diagram showing an example of a suitable on-chip cache; 

Figure 4 is a diagram showing an example of decompression; 

15 Figure 5 is a diagram showing an example a of planes/configuration table; 

Figure 6a is a block diagram of a portion of a memory plane stack; 

Figure 6b is a diagram of a group of corresponding memory cells, one cell 
20 form each plane of the memory stack of Figure 6a; 

Figure 6c is a diagram of an alternative embodiment of the memory stack of 
Figure 6a in which separate "ftinction" and "wire" stacks are provided; 

25 Figure 6d is a diagram of separate memory stacks provided for control, 

datapath and memory configuration, respectively; 

Figure 6e is a diagram of a common memory stack provided for control, 
datapath and memory configuration; and 

30 

Figure 7 is a schematic diagram of an alternative embodiment for a single bit 
of the memory stack of Figure 6a. 



35 



- 3 - 



PATENT 

ATTORNEY'S DOCKET NO. 032001-011 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Referring now tu Figure 1 , in a preferred embodiment of the present 
invention, an FPGA or reconfigurable logic-detiice is-provided with a configuration 
storage hierarchy having multiple levels, e.g., four levels: 1) off-chip storage, 2) 
compressed cache storage, 3) decompressed configuration planes, and 4) one or 
more active configuration planes. A description of each level follows, proceeding 
from lowest to highest level. 

Off-Chip 

The off-chip level of storage may be implemented in a variety of 
technologies, including without limitation EEPROM, RAM, hard drive, or I/O port. 
Preferably, the external storage device is memory mapped (corresponds to address 
entry in system CPU memory access space), and an instruction to load a specific 
configuration from off-chip storage device will include the configuration's starting 
address and length. The length of a configuration will vary depending on: how 
many computing elements it configures, the specific function for each computing 
element, and the amount of compression achieved. 

A configuration may include an arbitrary number of computing and/or 
routing elements. Nor is there any restriction that the elements be contiguous on the 
chip. Partial reconfiguration may be used to support a "data-in-place" computing 
style where some computing elements configured as registers and holding active 
data are left untouched, while other computing elements are reconfigured to perform 
new functions on the data. Partial reconfiguration may be performed. For "data in 
place," storage contents are left in place at either/or register and local memory 
elements. The control logic or wiring interconnectivity can be updated with new 
certification data while the rest of the configuration data fields for the storage 
remains unchanged. In a preferred embodiment, routing between elements can 
remain static while the control codes are updated. In both of these cases, selected 
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subsets of configurations are used, resulting in effective benefits of partial 
fccoufiguration. 

The off-chip configurations are stored ifee^ compressed format. One possible 
compression scheme is described here. Referring to Figure 2a, the computing 
elements on the chip are in a two dimensional X and Y array. A computing element 
is configured by storing an opcode (e.g., 1, 3, 7, etc.) in the computing element. 
Routing elements occupy rows and columns where all elements in a row have the 
same Y coordinate, and all elements in a column have the same X coordinate. 
Referring to Figure 2b, a single configuration consists of a series of instructions, to 
be executed in sequence, all with the following three-field format: Y control, X 
control, opcode. The Y control is a binary number from 0 to N-1, where N is the 
maximum possible Y coordinate. The X control is an N bit wide word, where N is 
the maximum possible X coordinate. In other words, the X control has one bit for 
every column, and the Y control is decoded for each row. In the row enabled by the 
Y control, for each element in the row where the corresponding X control bit is a 1 , 
the specified opcode will be loaded into the element. On top of this "common 
configuration" compression, the entire configuration (sequence of instructions) may 
be bit-wise run length compressed. 

In an alternative implementation, the Y control may not be encoded if the 
savings from simultaneously loading multiple rows with the same opcode outweighs 
the savings from encoding the row coordinate. 

. In addition to being compressed, the configurations may also be encrypted. 

The number of bits used to configure a single element may vary. It is 
possible to apply, for example, Huffman encoding to the set of possible 
configuration codes so that the more frequently used codes require fewer bits than 
the less frequently used codes. Even if a fixed bit-width is used for the opcode, 
maximizing the number of leading zeros will help in a run length compression 
scheme. 
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Compressed Cache 

The on-chip coiripicssed cache can be ioaded directly from the off-chip 
configurations. The on-chip cache has its owEc^edieated DMA server. The 
configurations are loaded directly from off-chip without any modification, in 
5 compressed format. As a resuU, more configurations can be stored in a given 

amount of cache, and the off-chip loading time is minimized. 

Referring to Figure 3, one possible implementation of the cache is as 
follows. The on-chip compressed cache may be implemented as a RAM witii 
multiple cache "lines", where each line consists of a configuration field, a contents 

10 addressable field, and a tagged bit field. The contents addressable field will store 

the address of the configuration, which is the same as the off-chip address used to 
load the configuration. The tagged bit field is used during a search of the cache for 
a given configuration. The tag bit is set to TRUE for any line with an address field 
that exactiy matches a searched for address, arid is set to FALSE otherwise. 

15 Whenever a configuration is loaded into the cache, a search is performed first to 

check if there is already a line with the same address. If so, the off-chip 
configuration is loaded on top of the existing line in the cache. If not, the first 
available line is used. A separate counter with wrap around is maintained to 
indicate the first available line. If the first available line's address field is not equal 

20 to zero, an error flag is raised. When a line in the cache is freed, its address field is 

set to zero. Instead of a wrap around counter, an alternate method for identifying 
an available line is to search for a zero address and use the first available. 

Decompressed Planes 
25 The decompressed planes are loaded with configurations from ttie 

compressed cache, with stream-oriented decompression and decoding. Once they 
are in the decompressed planes, configurations can be moved into the active plane in 
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as little as a single clock cycle. The decompressed planes serve as the rapid staging 
area for the active plane. 

Referring to Figure 4, one possibleimpi^entation of the decompression and 
decoding process is as follows. A fixed bit- width is assumed for the length field of 
the run-length compressed bitstream. The length field value is loaded into a count 
down counter. The next bit is shifted into a shift register until the counter reaches 
zero or the register is filled. The bit-width of the register corresponds to the length 
of a single configuration instruction. The instruction's X, Y, and opcode fields \yill 
have been zero-filled so that the fields are always the same bit-width. When the 
register is filled, the fields will drive the loading of the decompressed plane 
accordingly. The process continues until a length field of zero is encountered. 

If the configuration instructions are encrypted, they will be decrypted after 
each configuration instruction is decompressed. In this case, local hardware would 
intervene to perform the decryption before the disbursement in the configurable 
storage planes. 

Referring to Figure 5, a separate table is maintained that stores the address 
of the configuration that is currentiy loaded in each decompressed plane. While the 
chip is executing, this table can be used to verify that the intended configurations 
have actually been pre-fetched and are still resident in the planes. This table can 
also be used to save and restore the state of the chip in the event of an interrupt. 
This table can also be used to boot some initial configurations into the chip during 
power-up. 

Active Plane 

The active plane can be loaded from any of the decompressed planes. A 
particular embodiment of a memory plane stack 1200 is shown in Figure 6a. In die 
illustrated example, the top two planes 1206, 1205 of the memory plane stack are 
configuration planes. Configuration data stored in these planes is applied to the 
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reconfigurable logic. In the illustrated embodiment, "function" configuration data 
and "wire" configuration data is stored in different planes. The bottom memory 
plane 1200a provides external access to the mes^ry -stack. Intermediate planes 
function, for example, as a configuration stack, storing configurations expected to 
be used by not presently active. In an exemplary embodiment, memory plane 0 is 
single port, for single-channel read and write between system memory and 
configuration storage. The remaining memory planes are dual port, having one read 
port and one write port. Dual port supports simultaneous loading and recirculation 
of configuration data with the local "stack." If no data compression is used, then 
simultaneous real-time monitoring is possible, e.g., by writing out a "snapshot" of 
one or more planes of the stack. 

A group of corresponding memory cells, one cell from each plane of the 
memory stack, is shown in Figure 6b, The ports of all of the cells are 
interconnected so as to allow an operation in which the contents of a cell within any 
plane may be read and then written to the corresponding cell of any other plane. For 
example, by activating the appropriate control signal, the contents of plane 4 may be 
read and written into plane 6. Such an operation may be accomplished, preferably, 
in a single clock cycle, or at most a few clock cycles. Configuration data is loaded 
from external main memory into plane 0 of the memory stack in anticipation of its 
being transferred into a configuration plane. 

Alternatively, separate "function" and "wire" stacks may be provided, as 
shown in Figure 6c. Using this arrangement, function and wire configurations may 
be changed simultaneously. Similarly, configuration stacks for configuration of 
control, datapath and memory may be combined (Figure 6d) or separate (Figure 
6e). 

A schematic diagram of an alternative embodiment of a cell stack is shown 
in Figure 7, showing a cross section of several configuration planes 1301-1304 and 
the lockable fabric-definition cell 1305 that produces a Fabric_Define_Data bit for a 
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single bit location. These bits are aggregated in order to form sufficient bit numbers 
for functional cell type definition. For instance, a four bit grouping might designate 
between four to sixteen different cell type defiskions. The other latch sites below 
the storage cell are for additional configuration plane data available for swapping as 
needed by functional scheduling requirements. These storage locations can be 
written and read to from a common configuration data bus structure. The 
Config Read Data and Config_Load_Data buses 1307 and 1309, although shown as 
being separate, can be combined as a single bi-directional bus for wiring efficiency. 
This bus structure allows configuration data to be written as needed. The 
Swap Read Plane buffer 1311 allows existing configuration plane data contents to 
be swapped among differing configuration planes on a selectable basis. For instance, 
the current operation plane of data can be loaded from configuration plane 1 to 
configuration plane 2 by the use of the Swap_Read_Plane buffer 1311. The structure 
shown in Figure 7 is similar to a conventional SRAM memory structure which 
allows a dense VLSI circuitry implementation using standard memory compiler 
technology. This structure could also be implemented as a conventional dual port 
RAM structure (not shown) which would allow for concurrent operation of the write 
and read data operations. Unlike Figure 6b, the example of Figure 7 assumes 
separate configuration stacks for each configuration plane as described hereinafter. 
That is, the bit stack produces only a single Fabric_Define_Data bit instead of 
multiple fabric definition data bits as in Figure 6b. The bits could also be extended 
to include registers operating in a like fashion. 

If the Data_Recirc_Read line 1313 is also connected to data storage locations 
that are used for normal circuit register operation, then real time monitoring of 
device operations can be utilized by the operating system for applications such as 
RMON in internetworking application area or for real time debug capability. The 
RMON application basically uses counter operation status from registers in order to 
determine system data operation flow characteristics. 
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It will be appreciated by those of ordinary skill in the art that the invention 
can be embodied in other specific forms without departing from the spirit or 
essential character thereof. The presently <Jis^Gsed embodiments are therefore 
considered in all respects to be illustrative and not restrictive. The scope of the 
invention is indicated by the appended claims rather than the foregoing description, 
and all changes which come within the meaning and range of equivalents thereof are 
intended to be embraced therein. 
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What is claimed is: 

1. A hierarchy of logic configuration storage, the hierarchy consisting 
of at least external storage, on-chip compresseff^acKe, on-chip decompressed 
planes, and one or more active planes. 

2. A method of configuration of an array of computing and/or 
interconnect elements, comprising: 

decoding configuration memory by rows and columns; and 
applying configuration codes to intersections of selected rows and 

columns, including a multiplicity of such intersections for the same 

configuration code. 

3. The method of Claim 2, further comprising overwriting an existing 
configuration code with a new code, allowing the efficient compression of regular 
arrays with differing end conditions such as configurations of different sizes and 
differing contents for items of configuration. 

4. The method of Claim 3, further comprising: 

changing at least some of a computing element's configuration; 
holding fixed at least some of a storage element's configuration; 
whereby data-in-place reconfiguration is achieved. 

5. A reconfigurable computing chip comprising an on-chip configuration 
cache containing a multiplicity of stored configurations, wherein each configuration 
is identified by a unique off-chip address used to fetch that configuration. 

6. The apparatus of Claim 5, where each configuration is compressed. 
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7. The apparatus of Claim 5, where the identification of the addresses is 
perfonned using ccntents-addressable memory. 

8. In a reconfigurable computing system including a reconfigurable 
computing chip having reconfigurable logic and multiple configuration planes, the 
system further including off-chip storage, a method of configuration management, 
comprising storing in a table a current state of the on-chip configurations, said table 
consisting of multiple entries, each entry identifying an on-chip configuration plane 
and identifying a unique off-chip address of a loaded configuration. 

9. The method of Claim 8, further comprising: 

saving the table, thereby saving the entire state of the reconfigurable 

logic; 

loading the table; and 

from information stored in the table, loading the identified 
configurations into the identified on-chip planes. 

10. The method of Claim 8, further comprising establishing initial boot 
conditions in the reconfigurable logic by: 

specifying in the table an address of a boot configuration; and 
automatically loading the boot configuration into the reconfigurable 
computing chip on boot up. 
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ABSTRACT OF THE DISCLOSURE 

The present invention, generally speaki^, provides a hierarchy of 
configuration storage. The highest level of the hierarchy is an active configuration 
store; the lowest level is an off-chip configuration store; in between are one or more 
levels of configuration stores. Every configuration is promoted from the lowest 
off-chip level, through each level, up to the highest active level. Each ascending 
level of the hierarchy has a decreasing latency time required to promote a 
configuration to the next higher level of the hierarchy, and a decreasing amount of 
available storage. This separation into levels allows the amount of available storage 
to be adjusted depending on the inherent latency of the level's storage mechanism, 
where a longer latency requires a larger cache. This in turn allows the total required 
storage for a given performance level to be minimized. 
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